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Abstract 

Most current planners assume complete domain models 
and focus on generating correct plans. Unfortunately, 
domain modeling is a laborious and error-prone task. 
While domain experts cannot guarantee completeness, 
often they are able to circumscribe the incompleteness 
of the model by providing annotations as to which parts 
of the domain model may be incomplete. In such cases, 
the goal should be to generate plans that are robust with 
respect to any known incompleteness of the domain. In 
this paper, we first introduce annotations expressing the 
knowledge of the domain incompleteness, and formal- 
ize the notion of plan robustness with respect to an in- 
complete domain model. We then propose an approach 
to compiling the problem of finding robust plans to the 
conformant probabilistic planning problem. We present 
experimental results with Probabilistic-FF, a state-of- 
the-art planner, showing the promise of our approach. 



1. Introduction 

In the past several years, significant strides have been 
made in scaling up plan synthesis techniques. We now 
have technology to routinely generate plans with hun- 
dreds of actions. All this work, however, makes a 
crucial assumption-that a complete model of the do- 
main is specified in advance. While there are do- 
mains where knowledge-engineering such detailed mod- 
els is necessary and feasible (e.g., mission planning do- 
mains in NASA and factory-floor planning), it is increas- 
ingly recognized (c.f. ( [Hoffmann, Weber, and Kraft 2010[ 
pCambhampati 2007 1) that there are also many scenarios 
where insistence on correct and complete models renders 
the current planning technology unusable. What we need 
to handle such cases is a planning technology that can get 
by with partially specified domain models, and yet generate 
plans that are "robust" in the sense that they are likely to 
execute successfully in the real world. 

This paper addresses the problem of formalizing the 
notion of plan robustness with respect to an incomplete 
domain model, and connects the problem of generating 
a robust plan under such model to conformant prob- 
abilistic planning ([K ushmeiick, Hanks, and Weld 19951 



Hyafil and Bacchus 2003; Bryce, Kambhampati, and Smith 2006 
Domshlak and Hoffmann 2007E Following Garland & 



Lesh (I2OO2I 1. we shall assume that although the domain 
modelers cannot provide complete models, often they 
are able to provide annotations on the paitial model cir- 
cumscribing the places where it is incomplete. In our 
framework, these annotations consist of allowing actions to 
have possible preconditions and effects (in addition to the 
standard necessary preconditions and effects). 

As an example, consider a variation of the Gripper do- 
main, a well-known planning benchmark domain. The robot 
has one gripper that can be used to pick up balls, which are 
of two types light and heavy, from one room and move them 
to another room. The modeler suspects that the gripper may 
have an internal problem, but this cannot be confirmed until 
the robot actually executes the plan. If it actually has the 
problem, the execution of the pick-up action succeeds only 
with balls that are not heavy, but if it has no problem, it can 
always pickup all types of balls. The modeler can express 
this partial knowledge about the domain by annotating the 
action with a statement representing the possible precondi- 
tion that balls should be light. 

Incomplete domain models with such possible precondi- 
tions/effects implicitly define an exponential set of complete 
domain models, with the semantics that the real domain 
model is guaranteed to be one of these. The robustness of a 
plan can now be formalized in terms of the cumulative prob- 
ability mass of the complete domain models under which it 
succeeds. We propose an approach that compiles the prob- 
lem of finding robust plans into the conformant probabilistic 
planning problem. We present experimental results showing 
scenarios where the approach works well, and also discuss 
aspects of the compilation that cause scalability issues. 

2. Related Work 

Although there has been some work on reducing the "faults" 
in plan execution (e.g. the work on k-fault plans for non- 
deterministic planning (I Jensen, Veloso, and Bryant 2004[ l), 
it is based in the context of stochastic/non-deterministic ac- 
tions rather than incompletely specified ones. The seman- 
tics of the possible preconditions/effects in our incomplete 
domain models differ fundamentally from non-deterministic 
and stochastic effects. Executing different instances of the 
same pick-up action in the Gripper example above would 
either all fail or all succeed, since there is no uncertainty but 
the information is unknown at the time the model is built. In 



contrast, if the pick-up action's effects are stochastic, then 
trying the same picking action muhiple times increases the 
chances of success. 

Garland & Lesh (2002) share the same objective with us 
on generating robust plans under incomplete domain mod- 
els. However, their notion of robustness, which is defined 
in terms of four different types of risks, only has tenuous 
heuristic connections with the likelihood of successful ex- 
ecution of plans. Robertson & Bryce ( |2009t focuses on 
the plan generation in Garland & Lesh model, but their ap- 
proach still relies on the same unsatisfactory formulation of 
robustness. The work by Fox et al (2006 ) also explores ro- 
bustness of plans, but their focus is on temporal plans under 
unforeseen execution-time variations rather than on incom- 
pletely specified domains. Our work can also be categorized 
as one particular instance of the general model-lite planning 
problem, as defined in ( [Kambhamp ati 2007), in which the 
author points out a large class of applications where han- 
dling incomplete models is unavoidable due to the difficulty 
in getting a complete model. 

3. Problem Formulation 

We define an incomplete domain model T) as D — {F, A), 
where F = {pi , p2 7 ■ • ■ , } is a set of propositions, A is a set 
of actions that might be incompletely specified. We denote 
T and F as the true and false truth values of propositions. 
A state s C F is a set of propositions. In addition to propo- 
sition sets that are known as its preconditions Pre{a) C F, 
add effects Add{a) C F and delete effects Del{a) C F, 
each action a G A also contains: 

• Possible precondition set Pre{a) C F contains proposi- 
tions that action a might need as its preconditions. 

• Possible add (delete) effect set Add{a) C F (Del{a) C 
F) contains propositions that the action a might add 
(delete, respectively) after its execution. 

In addition, each possible precondition, add and delete ef- 
fect p of the action a is associated with a weight wP^'^{p), 
wf'^{p) andwf (p) (0 < wP;"'{p),wf'^{p),wf^{p) < 1) 
representing the domain modeler's assessment of the likeli- 
hood that p will actually be realized as a precondition, add 
and delete effect of a (respectively) during plan execution. 
Possible preconditions and effects whose likelihood of real- 
ization is not given are assumed to have weights of i. 

Given an incomplete domain model V, we define its 
completion set {{V)) as the set of complete domain mod- 
els whose actions have all the necessary preconditions, 
adds and deletes, and a subset of the possible precondi- 
tions, possible adds and possible deletes. Since any sub- 
set of Pre{a), Add{a) and Del{a) can be realized as pre- 
conditions and effects of action a, there are exponentially 
large number of possible complete domain models T>i e 

m = {Vi,V2, V^k}, where K = EaGA(l^e(a)| + 

|Ad(i(a)| + \Del{a)\). For each complete model Vi, we de- 
note the corresponding sets of realized preconditions and ef- 
fects for each action a as Prei{a), Addi{a) and Deli{a); 
equivalently, its complete sets of preconditions and effects 



are Pre[a) U Prei{a), Add{a) U Addi{a) and Del{a) U 
Deli{a). 

The projection of a sequence of actions tt from an initial 
state / according to an incomplete domain model V is de- 
fined in terms of the projections of tt from / according to 
each complete domain model Pj G ((I?)): 

7(71, y ^{^,i,Vi) (1) 

X).e{{D)) 

where the projection over complete models is defined in the 
usual STRIPS way, with one important difference. The re- 
sult of applying an action a in a state s where the precondi- 
tions of a are not satisfied is taken to be s (rather than as an 
undefined state) Q 

A planning problem with incomplete domain is P = 
(P, /, G) where / C F is the set of propositions that are 
true in the initial state, and G is the set of goal proposi- 
tions. An action sequence tt is considered a valid plan for 
Vifn solves the problem in at least one completion of ((2?)). 
Specifically, 3j,^g^^5^^7(7r, J, P,) ^ G. 

Modeling Issues in Annotating Incompleteness: From the 
modeling point of view, the possible precondition and effect 
sets can be modeled at either the grounded action or action 
schema level (and thus applicable to all grounded actions 
sharing the same action schema). From a practical point of 
view, however, incompleteness annotations at ground level 
hugely increase the burden on the domain modeler. To offer 
a flexible way in modeling the domain incompleteness, we 
allow annotations that are restricted to either specific vari- 
ables or value assignment to variables of an action schema. 
In particular: 

• Restriction on value assignment to variables: Given 
variables Xi with domains Xi, one can indicate 
that p{xi^, ...,Xi^) is a possible precondition/effect of 
an action schema a(xi,...,x„) when some variables 
Xj-^,...,Xji have values Cj^ G Xjj,...,Cj, G Xj^ 
({ii, ...,ifc}, {ji, ji} C {!,..., n}). Those possible 
preconditions/effects can be specified with the annotation 
p{xi-^, ...,Xi^) : when {xj^ ~ ci A ... A Xj^ = c;) for 
the action schema a{xi, Xn)- More generally, we al- 
low the domain writer to express a constraint C on the 
variables Xj-^, xj^ in the : when construct. The annota- 
tion ^(a;^^, 2;^^,) : when (C) means that p(cij ) 
is a possible precondition/effect of an instantiated action 
a{ci, Cn) (ci G Xi) if and only if the assignment 
{xj-^ := Cjj , Xj, := Cj, ) satisfies the constraint C. This 
syntax subsumes both the annotations at the ground level 
when / = n, and at the schema level if / = (or the 
: when construct is not specified). 

• Restriction on variables: Instead of constraints on ex- 
plicit values of variables, we also allow the possible 
preconditions/effects p{xi^, ...,Xi^) of an action schema 

'We shall see that this change is necessary so that we can talk 
about increasing the robustness of a plan by adding additional ac- 
tions. 



a{xi, Xn) to be dependent on some specific variables 
Xj-^ , Xj, without any knowledge of their restricted val- 
ues. This annotation essentially requires less amount 
of knowledge of the domain incompleteness from the 
domain writer Semantically, the possible precondi- 
tion/effect ^(xi^ x^^.) : depends (ccj J a; j, ) of an 
action schema a(xi, ...,x„) means that (1) there is at least 
one instantiated action a(ci,...,c„) {ci G Xi) having 
p(cij , Cij.) as its precondition, and (2) for any two as- 
signments (a;i ci,...,a;„ := c„),(xi := ci,...,x„ 
c'„) such that Cj^ = c'j^ (1 < i < 0^ either both 
p{ci-^, ■■■,Ci^) and p{c[_^, •■•'"^ifc) ^^"^ preconditions of the 
corresponding actions, or they are not. Similar to the 
: when above, the : depend construct also subsumes the 
annotations at the ground level when I = n, and at the 
schema level if / = (or the : depend field is not speci- 
fied). 

Another interesting modeling issue is the correlation 
among the possible preconditions and effects across actions. 
In particular, the domain writer might want to say that two 
actions (or action schemas) will have specific possible pre- 
conditions and effects in tandem. For example, we might say 
that the second action will have a particular possible precon- 
dition whenever the first one has a particular possible effect. 
We note that annotations at the lifted level introduce correla- 
tions among possible preconditions and effects at the ground 
level. 

Although our notion of plan robustness and approach to 
generating robust plans (see below) can be adapted to allow 
such flexible annotations and correlated incompleteness, for 
ease of exposition we limit our discussion to uncorrelated 
possible precondition and effect annotations specified at the 
schema level (i.e. without using the : when and : depend 
constructs). 

4. A Robustness Measure for Plans 

Given an incomplete domain planning problem V = 
{V, I, G), a valid plan (by our definition above) need only 
to succeed in at least one completion of T). Given that {{V)) 
can be exponentially large in terms of possible preconditions 
and effects, validity is too weak to guarantee on the quality 
of the plan. What we need is a notion that tt succeeds in 
most of the highly likely completions of T). We do this in 
terms of a robustness measure. _ 

The robustness of a plan tt for the problem V is defined 
as the cumulative probability mass of the completions of V 
under which vr succeeds (in achieving the goals). More for- 
mally, let Pr(2?i) be the probability distribution represent- 
ing the modeler's estimate of the probability that a given 
model in ((2?}) is the real model of the world (such that 
'^■D-e{{T>)) — !)■ The robustness of tt is defined 

as follows: 

R{7r,P : {■D,I,G))%^ ^ Pr(A) (2) 

I'.e((C)),7(7r,/,C,)hG 

It is easy to see that if R{tt, V) > 0, then tt is a valid plan 
for:P. 
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Figure 1 : An example of different complete domain models, 
and the corresponding plan status. Circles with solid and 
dash boundary are propositions that are known to be T and 
may be F (respectively) when the plan executes. (See text.) 

Note that given the uncorrelated incompleteness assump- 
tion, the probability Pr(2?i) for a model Vi G {{V)) can be 
computed as the product of the weights w^'^{p), w1'^'^{p), 
and w'l'^^{p) for all a G A and its possible precondi- 
tions/effects p if p is realized in the model (or the prod- 
uct of their "complement" 1 — wP^'^{p), 1 — w'^'^'^{p), and 
1 — wf'^' (p) if p is not realized). 

Example: Figure [1] shows an example with an incomplete 

domain model V ~ {F,A) with F = {^1,^2, Pa} and 
A — {fli , 02} and a solution plan tt = (oi , 02) for the prob- 
lem V = {T>,I = {P2}, G = {ps})- The incomplete model 
is^Pre(ai) = 0, Fre(ai) = {pi}, Add{ai) = {p2,P3}, 
Add{aiy^ 0, Del{ai) = 0, Del{ai) Pre{a2) = 
{P2}, Pfe{a2l^ 0, Add{a2) = 0, Add{a2) = {ps}, 

Del{a2) = 0, Del{a2) — {pi}- Given that the total number 
of possible preconditions and effects is 3, the total number 

of completions ( | {{V)) |) is 2^^ = 8, for each of which the plan 
TT may succeed or fail to achieve G, as shown in the table. 
The robustness value of the plan is R{tt) = | if Pr(2?i) 
is the uniform distribution. However, if the domain writer 
thinks that pi is very likely to be a precondition of ai and 
provides w'j^^'^{pi) = 0.9, the robustness of tt decreases to 
i?(7r) = 2 X '(0.9 X 0.5 x 0.5) +4 x (0.1 x 0.5 x 0.5) = 0.55 
(as intutively, the last four models with which tt succeeds 
are very unlikely to be the real one). Note that under the 
STRIPS model where action failure causes plan failure, the 
plan TT would considered failing to achieve G in the first two 
complete models, since 02 is prevented from execution. 

4.1 A Spectrum of Robust Planning Problems 

Given this set up, we can now talk about a spectrum of prob- 
lems related to planning under incomplete domain models: 

Robustness Assessment (RA): Given a plan tt for the 
problem V, assess the robustness of n. 



Maximally Robust Plan Generation (RG*): Given a 
problem V, generate the maximally robust plan vr*. 

Generating Plan with Desired Level of Robustness (RG''): 
Given a problem V and a robustness threshold p 
(0 < p < 1), generate a plan tt with robustness greater 
than or equal to p. 

Cost-sensitive Robust Plan Generation (RG*): Given 
a problem V and a cost bound c, generate a plan tt of 
maximal robustness subject to cost bound c (where the 
cost of a plan tt is defined as the cumulative costs of the 
actions in tt). 

Incremental Robustification (RIc): Given a plan tt for the 
problem V, improve the robustness of vr, subject to a cost 
budget c. 

The problem of assessing robustness of plans, RA, can 
be tackled by compiling it into a weighted model-counting 
problem. For plan synthesis problems, we can talk about 
either generating a maximally robust plan, RG*, or finding a 
plan with a robustness value above the given threshold, RG''. 
A related issue is that of the interaction between plan cost 
and robustness. Often, increasing robustness involves using 
additional (or costlier) actions to support the desired goals, 
and thus comes at the expense of increased plan cost. We 
can also talk about cost-constrained robust plan generation 
problem RG*. Finally, in practice, we are often interested 
in increasing the robustness of a given plan (either during 
iterative search, or during mixed-initiative planning). We 
thus also have the incremental variant RIc. 

In this paper, we will focus on RG'', the problem of syn- 
thesizing plan with at least a robustness value of p. 

5. Compilation to Conformant Probabilistic 
Planning 

In this section, we will show that the problem of generating 
plan with at least p robustness, RG'', can be compiled into an 
equivalent conformant probabilistic planning problem. The 
most robust plan can then be found with a sequence of in- 
creasing threshold values. 

5.1 Conformant Probabilistic Planning 

Following the formalism in 

(IDomshlak and Hoffmann 2007). a domain in confor- 
mant probabilistic planning (GPP) is a tuple V = {F' ,A'), 
where F' and A' are the sets of propositions and proba- 
bilistic actions, respectively. A belief state 6 : 2^ — [0, 1] 
is a distribution of states s C F' (we denote s e 6 if 
b{s) > 0). Each action a' G A' is specified by a set of 
preconditions Pre{a') C F' and conditional effects E{a'). 
For each e = {cons{e),0{e)) e E{a'), cons{e) C F' 
is the condition set and 0{e) determines the set of out- 
comes e = {Pr{e),add{e),del{e)) that will add and 
delete proposition sets add{e), del{e) into and from the 
resulting state with the probability Pr{e) (0 < Pr{e) < 1 
, J2eeo{e) Pfi^) — !)■ All condition sets of the effects in 
E{a') are assumed to be mutually exclusive and exhaustive. 
The action a' is applicable in a belief state b if Pre{a') C s 



for all s G b, and the probability of a state s' in the resulting 

belief state is ba'{s') = EoPre(a') Ks) Ee60'(e) ^^i^l 

where e e E{a') is the conditional effect such that 
cons{e) C s, and C(e) C 0{e) is the set of outcomes e 
such that s' — sL) add{e) \ del{e). 

Given the domain V, a problem V' is a quadruple V = 
{V , bj, G' , p'), where bj is an initial belief state, G" is a set 
of goal propositions and p' is the acceptable goal satisfaction 
probability. A sequence of actions tt' = {a[, a',^) is a 
solution plan for V' if a[ is applicable in the belief state bi 
(assuming bi = bj), which results in (1 < i < n), and 
it achieves all goal propositions with at least p' probability. 

5.2 Compilation 

Given an incomplete domain model V — {F, A) and a plan- 
ning problem V = {V, I, G), we now describe a compila- 
tion that translates the problem of synthesizing a solution 
plan TT for V such that R{tt, V) > p to a CPP problem 
V' ■ At a high level, the realization of possible precondi- 
tions p e Pre{a) and effects q G Add{a), r G Del (a) 
of an action a G A can be understood as being determined 
by the truth values of hidden propositions p^^'^, q"'^'' and 
rjj"^' that are certain (i.e. unchanged in any world state) but 
unknown. Specifically, the applicability of the action in a 
state s C F depends on possible preconditions p that are 
realized (i.e. pP^^ — T), and their truth values in s. Simi- 
larly, the values of q and r are affected by a in the resulting 
state only if they are realized as add and delete effects of 
the action (le., q^'^'^ T, r^"'^ = T). There are totally 
2\i^e{a)\+\Add{a)\+\D^i{a)\ realizations of the action a, and 
all of them should be considered simultaneously in checking 
the applicability of the action and in defining corresponding 
resulting states. 

With those observations, we use multiple conditional ef- 
fects to compile away incomplete knowledge on precondi- 
tions and effects of the action a. Each conditional effect 
corresponds to one realization of the action, and can be fired 
only if p — T whenever pP^'^ — T, and adding (removing) 
an effect q (r) into (from) the resulting state depending on 
the values of g^'^'' (r^^\ respectively) in the realization. 

While the partial knowledge can be removed, the hidden 
propositions introduce uncertainty into the initial state, and 
therefore making it a belief state. Since the action a may 
be applicable in some but rarely all states of a belief state, 
certain preconditions Pre{a) should be modeled as condi- 
tions of all conditional effects. We are now ready to formally 
specify the resulting domain V and problem V'. 

For each action a <E A, we introduce new propositions 
pPJ'^, q'^'^, and their negations npPJ^, nq'^'^, nr'j;^^ for 

each p G Pre{a), q G Add{a) and r G Del{a) to deter- 
mine whether they are realized as preconditions and effects 
of a in the real domainQ Let Fnew be the set of those new 
propositions, then F' — F iJ Fnew is the proposition set of 

v. 



"These propositions are introduced once, and re-used for all ac- 
tions sharing the same schema with a. 



Each action a' e A! is made from one action a € 
A such that Pre{a') = 0, and E{a') consists of 
2\P^e{a)\+\Addia)\+\D^i{a)\ conditional effects e. For each 
conditional effect e: 

• cons(e) is the union of the following sets: 

- the certain preconditions Pre{a), 

- the set of possible preconditions of a that are realized, 
and hidden propositions representing their realization: 

Pr£(a) U {priP e TVi(a)} U {upPJ^lp E P^e{a) \ 

- the set of hidden propositions corresponding to the real- 
ization of possible add (delete) effects of a: {qa'^^^\q G 
Add(a)} U EAdd{a) \ ^dd{a)} ({rf '|r G 
i5e7(a)}U{nrf<='|r G DeZ(a) \Z3e7(a)}, respectively); 

• the single outcome e of e is defined as add(e) = Add{a)\J 
Add{a), del{e) = Del{a)\J~Dd{a), and Pr{e) = 1, 

where Pre{a) C Pre{a), Add{a) C Add{a) and Del{a) C 

Del{a) represent the sets of realized preconditions and ef- 
fects of the action. In other words, we create a conditional 
effect for each subset of the union of the possible precondi- 
tion and effect sets of the action a. Note that the inclusion of 
new propositions derived from Pre{a), Add{a), Del{a) and 

their "complement" sets Pre{a)\Pre{a), Add{a)\Add{a), 

Del{a) \ Del{a) makes all condition sets of the action a' 
mutually exclusive. As for other cases (including those in 
which some precondition in Pre (a) is excluded), the action 
has no effect on the resulting state, they can be ignored. The 
condition sets, therefore, are also exhaustive. 

The initial belief state 6/ consists of 21^"=™' states s' C F' 
such that p G s' iff p G / (Vp G F), each represents a com- 
plete domain model Vi G {{V)) and with the probability 
Pr(2?i). The goal is G' = G, and the acceptable goal satis- 
faction probability is p' ~ p. 

Theorem 1. Given a plan tt — (ai, an) for the problem 

V, and tt' — (a'l, a^) where a';, is the compiled version 

of ak (I < k < n) in T". Then R{'k, V) > p iffT^' achieves 
all goals with at least p probability in V' . 

Proof (sketch). According to the compilation, there is one- 
to-one mapping between each complete model T>i G ((2?)) 
in V and a (complete) state s^q G 6/ in V' . Moreover, if Vi 
has a probability of Pr(2?i) to be the real model, then s^q 
also has a probability of Pr(Pi) in the belief state hi of V' . 

Given our projection over complete model Vi, executing 
TT from the state / with respect to Vi results in a sequence 
of complete state (sii, s,;(„+i)). On the other hand, exe- 
cuting tt' from {s[q] in V' results in a sequence of belief 
states ({s-i},...,{s^(„+i)}). With the note that p G 
iff p G / (ip G F), by induction it can be shown that 
p G s-^- iffp G s,j (Vj G {!,..., n + G F). There- 
fore, s,(„+i) h G iff s^(„_^^) h G = G". 

Since all actions a' are deterministic and s^q has a proba- 
bility of Pr(2?i) in the behef state hi of V' , the probability 



pick-up 

:parameters {?b - ball ?r - room) 
: precondition 

(and {at ?b ?r) (at-robot ?r) (free-gripper) ) 
: possible_pre condition 

(and (light ?b) ) 
: effect 

(and (carry ?b) (not (at ?b ?r) ) (not (free-gripper) ) ) 
; possible_ef f ect 

(and (dirty ?b) ) 

pick-up 

:parameters (?b - ball ?r - room) 
: precondition (and ) 
:effect (and 

(when (and (at ?b ?r) (at-robot ?r) (free-gripper) 
(light ?b) ( p'",t-,.p) i q°pL-up ) ) 
(and (carry ?b) (not (at ?b ?r) ) (not (free-gripper)) 
(dirty ?b) } ) 

(when (and (at ?b ?r) (at-robot ?r) (free-gripper) 

pre add 

(light ?b) iP pick-up) inq pick-up)) 
(and (carry ?b) (not (at ?b ?r) ) (not (free-gripper) ) ) ) 
(when (and (at ?b ?r) (at-robot ?r) (free-gripper) 

„/"■£' add 

(nPpi,t-„p) i1 rick-up )) 
(and (carry ?b) (not (at ?b ?r) ) (not (free-gripper)) 
(dirty ?b) ) ) 

(when (and (at ?b ?r) (at-robot ?r) (free-gripper) 

^"p'pZk-up^ ^"9pL-up )) 

(and (carry ?b) (not (at ?b ?r) ) (not (free-gripper) ) ) ) ) 



Figure 2: An example of compiling the action pick-up in an 
incomplete domain model (top) into CPP domain (bottom). 
The hidden propositions Pp[4-«p' Ipfck-up their nega- 
tions can be interpreted as whether the action requires light 
balls and makes balls dirty. Newly introduced and relevant 
propositions are marked in bold. 

that tt' achieves G" is V , i^'PrCVi), which is equal 

i(n + l)|— ^ 

to -R(7r, V) as defined in Equation|2] This proves the theo- 
rem. □ 

Example: Consider the action pick-up(?b - ball, ?r - room) 
in the Gripper domain as described above. In addition to the 
possible precondition (light ?b) on the weight of the ball ?b, 
we also assume that since the modeler is unsure if the gripper 
has been cleaned or not, she models it with a possible add 
effect (dirty ?b) indicating that the action might make the 
ball dirty. Figure|2]shows both the original and the compiled 
specification of the action. 

6. Experimental Results 

We tested the compilation with Probabilistic-FF (PFF), a 
state-of-the-art planner, on a range of domains in the Inter- 
national Planning Competition. We first discuss the results 
on the variants of the Logistics and Satellite domains, where 
domain incompleteness is deliberately modeled on the pre- 
conditions and effects of actions (respectively). Our purpose 
here is to observe how generated plans are robustified to sat- 
isfy a given robustness threshold, and how the amount of 
incompleteness in the domains affects the plan generation 
phase. We then describe the second experimental setting 



in which we randomly introduce incompleteness into IPC 
domains, and discuss the feasibility of our approach in this 
setting 3 

Domains with deliberate incompleteness 

Logistics: In this domain, each of the two cities Ci and 
C2 has an airport and a downtown area. The transporta- 
tion between the two distant cities can only be done by 
two airplanes Ai and ^2- In the downtown area of Ci 
(i G {1, 2}), there are three heavy containers Pn , that 
can be moved to the airport by a truck Tj. Loading those 
containers onto the truck in the city Ci, however, requires 
moving a team of m robots Rn, Rim {m > 1), initially 
located in the airport, to the downtown area. The source 
of incompleteness in this domain comes from the assump- 
tion that each pair of robots Rij and i?2j (1 < j < "i) are 
made by the same manufacturer Mj, both therefore might 
fail to load a heavy container^ The actions loading contain- 
ers onto trucks using robots made by a particular manufac- 
turer (e.g., the action schema load-tmck-with-robots-of-Ml 
using robots of manufacturer Mi), therefore, have a possible 
precondition requiring that containers should not be heavy. 
To simplify discussion (see below), we assume that robots 
of different manufacturers may fail to load heavy contain- 
ers, though independently, with the same probability of 0.7. 
The goal is to transport all three containers in the city Ci to 
C2, and vice versa. For this domain, a plan to ship a con- 
tainer to another city involves a step of loading it onto the 
truck, which can be done by a robot (after moving it from 
the airport to the downtown). Plans can be made more ro- 
bust by using additional robots of different manufacturer af- 
ter moving them into the downtown areas, with the cost of 
increasing plan length. 

Satellite: In this domain, there are two satellites Si and 6*2 
orbiting the planet Earth, on each of which there are m in- 
struments Lii, Lim (i e {1; 2}, TO > 1) used to take im- 
ages of interested modes at some direction in the space. For 
each j e {1, ...,m}, the lenses of instruments Lij's were 
made from a type of material Mj, which might have an er- 
ror affecting the quality of images that they take. If the ma- 
terial Mj actually has error, all instruments Lij's produce 
mangled images. The knowledge of this incompleteness is 
modeled as a possible add effect of the action taking images 
using instruments made from Mj (for instance, the action 
schema take-image-with-instruments-Ml using instruments 
of type All) with a probability of pj, asserting that images 
taken might be in a bad condition. A typical plan to take 
an image using an instrument, e.g. L14 of type A/4 on the 
satellite Si, is first to switch on L14, turning the satellite 5*1 
to a ground direction from which L14 can be calibrated, and 
then taking image. Plans can be made more robust by using 
additional instruments, which might be on a different satel- 

^The experiments were conducted using an Intel Core2 Duo 
3.16GHz machine with 4Gb of RAM, and the time limit is 15 min- 
utes. 

''The uncorrelated incompleteness assumption applies for pos- 
sible preconditions of action schemas specified for different manu- 
facturers. It should not be confused here that robots Rij and 7?2j 
of the same manufacturer Mj can independently have fault. 



p 


m — 1 


m = 2 


m — 3 


m — 4 


m — 5 


0.1 


32/10.9 


36/26.2 


40/57.8 


44/121.8 


48/245.6 


0.2 


32/10.9 


36/25.9 


40/57.8 


44/121.8 


48/245.6 


0.3 


32/10.9 


36/26.2 


40/57.7 


44/122.2 


48/245.6 


0.4 




42/42.1 


50/107.9 


58/252.8 


66/551.4 


0.5 


± 


42/42.0 


50/107.9 


58/253.1 


66/551.1 


0.6 






50/108.2 


58/252.8 


66/551.1 


0.7 






_L 


58/253.1 


66/551.6 


0.8 






_L 




66/550.9 


0.9 






_L 


± 





Table 1 : The results of generating robust plans in Logistics 
domain (see text). 
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Table 2: The results of generating robust plans in Satellite 
domain (see text). 



lite, but should be of different type of materials and can also 
take an image of the interested mode at the same direction. 

Table[T]and|2]shows respectively the results in the Logis- 
tics and Satellite domains with p e {0.1, 0.2, 0.9} and 
m = {1,2,..., 5}. The number of complete domain models 
in the two domains is 2™. For Satellite domain, the probabil- 
ities pj's range from 0.25, 0.3,... to 0.45 when to increases 
from 1, 2, ... to 5. For each specific value of p and m, we 
report / /t where / is the length of plan and t is the running 
time (in seconds). Cases in which no plan is found within the 
time limit are denoted by and those where it is provable 
that no plan with the desired robustness exists are denoted 
by 

Observations on fixed value of m: In both domains, for a 
fixed value of to we observe that the solution plans tend to 
be longer with higher robustness threshold p, and the time 
to synthesize plans is also larger For instance, in Logistics 
with m — 5, the plan returned has 48 actions if p — 0.3, 
whereas 66-length plan is needed if p increases to 0.4. Since 
loading containers using the same robot multiple times does 
not increase the chance of success, more robots of differ- 
ent manufacturers need to move into the downtown area for 
loading containers, which causes an increase in plan length. 
In the Satellite domain with to — 3, similarly, the returned 
plan has 37 actions when p — 0.5, but requires 53 actions 
if p = 0.6 — more actions need to calibrate an instrument 
of different material types in order to increase the chance of 
having a good image of interested mode at the same direc- 
tion. 

Since the cost of actions is currently ignored in the compi- 
lation approach, we also observe that more than the needed 
number of actions have been used in many solution plans. In 



the Logistics domain, specifically, it is easy to see that the 
probability of successfully loading a container onto a truck 
using robots of fc (1 < A; < m) different manufacturers is 
(1 — 0.7*^). As an example, however, robots of all five man- 
ufacturers are used in a plan when p = 0.4, whereas using 
those of three manufacturers is enough. 

Observations on fixed value of p: In both domains, we 
observe that the maximal robustness value of plans that can 
be returned increases with higher number of manufacturers 
(though the higher the value of m is, the higher number of 
complete models is). For instance, when m = 2 there is not 
any plan returned with at least p = 0.6 in the Logistics do- 
main, and with p = 0.4 in the Satellite domain. Intuitively, 
more robots of different manufacturers offer higher prob- 
ability of successfully loading a container in the Logistics 
domain (and similarly for instruments of different materials 
in the SatelUte domain). 

Finally, it may take longer time to synthesize plans with 
the same length when m is higher — in other words, the in- 
creasing amount of incompleteness of the domain makes the 
plan generation phase harder. As an example, in the Satellite 
domain, with p = 0.6 it takes 216.7 seconds to synthesize 
a 37-length plan when there are m = 5 possible add effects 
at the schema level of the domain, whereas the search time 
is only 94.1 seconds when m = 4. With p = 0.7, no plan 
is found within the time limit when m = 5, although a plan 
with robustness of 0.7075 exists in the solution space. It is 
the increase of the branching factors and the time spent on 
satisfiability test and weighted model-counting used inside 
the planner that affect the search efficiency. 

Domains with random incompleteness 

We built a program to generate an incomplete domain model 
from a deterministic one by introducing M new propositions 
into each domain (all are initially T). Some of those new 
propositions were randomly added into the sets of possible 
preconditions/effects of actions. Some of them were also 
randomly made certain add/delete effects of actions. With 
this strategy, each solution plan in an original deterministic 
domain is also a valid plan, as defined earlier, in the cor- 
responding incomplete domain. Our experiments with the 
Depots, Driverlog, Satellite and ZenoTravel domains indi- 
cate that because the annotations are random, there are often 
fewer opportunities for the PFF planner to increase the ro- 
bustness of a plan prefix during the search. This makes it 
hard to generate plans with a desired level of robustness un- 
der given time constraint. 

In summary, our experiments on the two settings above 
suggest that the compilation approach based on the PFF 
planner would be a reasonable method for generating robust 
plans in domains and problems where there are chances for 
robustifying existing action sequences in the search space. 

7. Conclusion and Future Work 

In this paper, we motivated the need for synthesizing robust 
plans under incomplete domain models. We introduced an- 
notations for expressing domain incompleteness, formalized 
the notion of plan robustness, and showed an approach to 
compile the problem of generating robust plans into confor- 



mant probabilistic planning. We presented empirical results 
showing the promise of our approach. For future work, we 
are developing a planning approach that directly takes the 
incompleteness annotations into account during the search, 
and compare it with our current compilation method. We 
also plan to consider the problem of robustifying a given 
plan subject to a provided cost bound. 
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